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COMPOSITIONS AND METHODS FOR PROTEIN 
PURIFICATION BASED ON A METAL ION AFFINITY SETE 

5 



10 

BACKGROUND OF THE INVENTION 

Cross Refer ence to RejflT«>H Applications 

This appUcation claims benefit of non-provisional 
15 appUcation US Serial number 09/078.687 filed May 14, 1998. 

pi«»lH nf the Invention 

This invention relates generally to the field of protein 
chemistry. SpecificaUy, the present invention relates to protein 
20 purification using a metal ion affinity site. 

packground of the. Invention: 

Development of protocols for the isolation and 
purification of proteins is often a long and costly process. Such 
25 protocols usually contain multiple steps, where some of the steps 
have recoveries as low as 50%. Further, due to variation between 
protein molecules, a purification protocol developed and effective 
for Oie purification of one protein is not necessarily useful for the 
purification of anotiier. In fact, m most cases considerable 
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adaptations must be made to a purification protocol to 
accommodate the various physical and chemical characteristics of 

different proteins. 

The ability to prepare hybrid genes by genetic 
5 engineering technology has opened up new possibilities for the 
purification of proteins. For example, one can Unk a DNA 
sequence of a protein of interest to a nucleic acid sequence which 
codes for a peptide which has a high binding affmity for a specific 
Ugand. The fusion protein product resulting from expression of 
10 this DNA has attributes of both the protein of interest and the 
high affmity peptide. To purify or immobilize the engineered 
fusion protein, the ligand commonly is linked to a support, and the 
unpurified, engineered protein is then exposed to the 
Ugand/support composite and allowed to bind. 
15 There are numerous advantages of using a high 

affinity fusion protein. For example, the use of an affinity peptide 
ensures that no part of the native protein of interest is involved in 
adsorption-the binding between the fusion protein and the ligand. 
At the same time, extremely high selectivity in the adsorption 

20 process is achieved. 

Immobilized Metal Ion Affinity Chromatography 
(IMAC) is one of the most frequently used techniques for 
purification of fusion proteins containing affmity sites for metal 
ions. Proper choice of immobiUzed metal ion, loading conditions 
25 and elution conditions can result in protein purification of up to 
about 95-98% in a single chromatographic step. Moreover, 
recovery generally is higher than 85%. lu addition to the 
advantages discussed above, incorporation of a proteolytic, 
chemical, or enzymatic cleavage site into the composite DNA, 
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between the affinity peptide and the sequence of the protein of 
interest, provides a means for cleaving the affinity peptide from 
the protein of interest to yield the native protein of interest in 

highly purified form. 

5 The following publications are representative of the 

art: Itakura. et al.. Science 198:1056-63 (1977); Germino, et al., 
PNAS USA 80:6848-52 (1983); Nilsson et al., Nucleic Acid Res. 
131151-62 (1985); Smith et al.. Gene 32:321-27 (1984); DobeU, et 
al., U.S. Pat. No. 5,284,933; and DobeU. et al., U.S. Pat. No. 

10 5.310.663. 

The prior art is deficieint in improved compositions 
and metiiods for affinity immobilization and purification of 
proteins. This invention fulfills this long-felt need in tiie art. 
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SUMMARY OF THE INVENTION 



20 



25 



The present invention relates to compositions and 
metiiods for protein purification involving the use of novel, 
genetically-engineered fusion proteins. These fusion proteins are 
engineered to allow for immobiUzation and purification via the 
high affinity interaction of an affinity peptide of a fusion protein 
witii a Ugand. The affinity peptide is a histidine-rich polypeptide 
sequence witii a general sequence: (HX„)„. wherein H is histidine. 
X is an amino acid otiier tiian histidine. n= 1-8, m= 2-30, and 
wherein if n=l for more than two adjacent units of HX. at least one 
X must be asparagine. phenylalanine, tiyptophan. tyrosine, lysine, 
methionine, arginine, glutamine. or cysteine. The affinity peptide 
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U U,*ed u, tt.e pro^ins of .oteres, Rl «.d R2 u. yield . fusion 
p„teia wi* formula R1-(HX.)..R2. la a preferred embodiment of 
fl,o invention. n=l-4 and m=2-10. In a more preferred 
embodiment of .he iovenUon. n=l-4 and m=3-6. In a spec.f.c 
5 embodiment of .he invention, a fnsion pro»in having , e 
„e SLKDHUHNVHKEEHAHAHNHSWGVGAVGM (SEQ ID 
No-6) is provided. In anoti.er embodiment of U.e present aspect 
0, a,e invention, at leas, one pro«ase cleavage si« is inserted 
between tiie sequence of U.e pro«in of in.eres. «.d U,e seqnence 

10 of ttie affinity peptide. 

to anotiier aspec. of U,e invention. Acre is provided a 
DNA sequence coding for a fusion protein comprising a pro«m of 
i„«„s. fused a. its amino-terminus or carboxy-terminus «. a. 
leas, one affini.y peptide, where U» fusion protein has the 
,5 genera, formula R.-(HX„,.-RZ whe«in Rl or R2 is ti>e protem of 
interest H is histidine. Xis an amino acid odier ti«n histidine. n= 
,^ 2-30, and wherein if n=l for more U.a. two adjacent 
u„i« of HX. a. leas, one X must be asparagine. phenylalanrne. 
.rypu,phan. tyrosine, lysine, metiuonine. arginine. glummme. or 
30 cys.ei.e. In a specific embodimen. o, tins .spec, of *e inven..on. 
Uiere is provided a DNA sequence which codes for a p«..em where 
U.e fusion pro^in has *e sequence 

SLKDHLIHNVHKEEHAHAHNiaSWOVGAVGM(SEQIDNo«. 

to various embodiment of to .spec, of .he 
„ invention. U,ere is provided a recombinan. vector comprising an 
expr^sion vector and a DNA sequence coding for a fus.on pro.e.n 
comprising a protein of interest fused a. i« amino-terminus or 
earboxy-rerminus » a. leas, one affinity peptide as descr.be 
above, wherein U.e recombinan. vec.or is capable of d.recng 



PCTAJS99/10662 

W0 99/S7992 

expression of U.e DNA sequence to . s«iuU>le host organism. The 
present tovention also provides a host organism conuining a 
recombinant expression vecu,r comprising a DNA sequence codtog 
for a fusion protein comprising a protein of toterest fused at us 
5 amino-terminns or carboxy-terminns to at least one affintty 
pepdde as described above, whereto .he organism is capable of 

expresstog said DNA sequence. 

to yet an additional aspect of this invention, there is 
provided a method for purifying the novel fusion proteins of the 
,0 present tov«.Uon. comprising the steps of: contacUng a protc.n 
.ample containing the fusion proton to a mix«.re wid. other 
proteins wiU, a metal cheUte resta under condiUons where the 
fusion protein binds to resin to produce a resto-fusion pro.e.n 
complex; washing U,e resto-fusion proteto complex » ^J-"" 

,5 to remove the other, unbound proteins; ^ eluting d.. bound 
toion proteto from the washed resin-fusion proteto complex. One 
embodiment of dus method tocludes inserting a. leas, one 
protease cleavage site between «.e proteto of interest and he 
affinity peptide, and cleaving Ute proteto of toterest from the 
^ affinity peptide alter purification using ti.e metal chelae resm. 

Otter and furtiter asp«=ts. embodiments. te««res and 
advantages of the present invention win be apparent from the 
foUowing description of the invention. 
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BRIEF DESCRIPTION OF TDE HGUBES 

Figure 1 is a schematic represenuUon of the 
pGFPuv/HAT vector. This vector conuins one embodiment of the 
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.ffl^tv pepUd. of *e pres... i-nUon fused to a,e N-u™i»us 

restriction sites are noted. 

Figure 2 is a schematic representation of the vector 

. . the affinity peptide (AP) at the N- 

s DUC19/HS) containing part of the alimuy p p ^ , . . 

5 puuiv/n ; cleavage site followed by multiple 

terminus of Enterokinase (EK) cleavage 

cloning site (MCS). Only restriction sites are denoted 

Figure 3 is a schematic represenUUon of the 
restriction maps of a vector with three frame shifts c— 
,0 part of the affinity peptide and enterolcinase cleavage site that 
used for expression of recombinant proteins. 

DETAILED DESCRIFnON OF THE INVENTION 

ti«T, relates to compositions and 
The present invention relates 

„eU,o-s fo. puHficaaon of nov.. -^^J-^ /"J';;. 

IfflBit, i.«ractioo of an afBm.y peptide pomon of U.e fus.o 
M pro.«n wiU, a Ugand. The lugb .ff""ty P * 

~,lypeptide wi± a general sequence (HX„)„ wlucn 

„, in^rest Rl o, R2; therein H is histidine. Xis an am.no 
p„u=ms of interest Rl • 

.cid other than hisUdine, n= 1-8. n.- 2 3U, a 
„„re than two adjacent units of HX. at least one X ..^ he 
. .p..gine. Phen,u.anine. ..Ptopha. ^ 
^eU-ionine. argmine. glutaniiu.. « '"7;^ ^.^ „ 

is incorporated ---'J ^„ 
sequence of a protem of interest ana 
purification of the fusion protein. 
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The affiBity of the high affinity peptide is for 
.„„ohiU»d -.eta. io.». The strengU. of binding be«,eenj.e hig" 
afflni^ pep.de and an appropHaU tneta. .on » ven, h.^. t^u . 
isoUdon of .he fusion protein, is very sele^ve. Howev«, 

.. »nd the Usand is also reversible, 

association between the pept.de and the ug 

once .he fnsio. protein has been aUo«ed to .ssoc«.u or adsorb 
^U. «.e n.eu. ion ligand, U,e prorein can be -i'— 
..^ from *e «.e»l ion/adsorbent b, addiUon of coo.pe ... e 
such as inadazole. or b, decreasing U,e pa wbrch lea.. 

L chain and release of a.e adsorbed pro.in Because of .b.s 
.versibiU.,. *e pr„.ein is recovered in a purified. >';^;»^^«- 
FurOrer. regeneraUon and reuse of U.e »eU. .on/adsorben or 
.„^r. nrumple a»es-even .ore than .00 — P"^^''^,^^ 
An additional feamre of fte protein purrfication and 
in.n.obiU«.tion «chni,ues based on d.e principles of the present 
TenUon is the high probabiUt, tha. the puHfied and regenera 

. ■ of inures. wiU re«.in foU biological acuv..y and 
relci; rrbecause U.e affini. peptide is involved in .he 
, il—ion^ioding process where U.e portion of d-e fus.on 
protein a.a. contains die proKin of interest is not 
' incorporaaon o, a proteolytic site be.«.en ti» h.gh 

peptide and U.e se,uence of d,e prorein of in-»res. 
d.e means te regenerate a.e protein of .n.eres. f«,n. tho 
, Lou protein. Regeneration is achieved b, limited prceo^s 
«.e fuL protein and a s»ond chron.«.graphy step .n v,h.c .he 
::teol,.ic produc. is passed Uirough an — ^jne^ .« 

, Indeed one can utilize the same column as was 

affinitv column. Inaeea. one 

1 immobiU« and purify d.e fusion protein. I. U.e second 
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c.ro.atog.aphy s.p following proteolysis, cleaved 

regener p . whereas the high affinity peptide is 

immobilization or adsorpuon, wnercd 

adsorbed on the column. f,„tnres 

e tircxsent invenUon features 
, One embodiment of the present 

i.c.rpo.aU0. Of nucleic acid se,«e„ces wMch code for secreUon 
J» inu. *e DNA se<,„e„ce U.a. codes for «.e fu«on p o.«. 
:L cause U,e ft.s>.a pro.ei. .o be sec«^ n» 

0,e .edia after syo«.esis in a host ceU. Since a cons.der Me 
,0 lr„. of .ou. ceUnlar pro«i. regains in U.e ceU. secreuon 
Zves dra.a.cal,y U.e isoUaon and p«rir.ca.on of *e fns.on 
:L by e,i.inaU„g U,e need for ceil disrupUon. protern 
elcuon. and/or ren.ovai of „nw«,t^ ceilular cocponents and 

nucleic acid. , "Kicrh 

AS used herein, the terms "affinity peptide or high 

HH." refer to a histidine-rich polypeptide with a 
affinity pepude refer to a „f i^t^rest Rl 

fHX ^ which is Unked to proteins of interest Kl 
eeneral sequence (rtAaJm wmwu 

„ • u ctj^iin^ X is an amino acid other than 
or R2- wherein H is histidme, A is an 

. ism- 2-30 and wherein if n=l for more Aan two 

of HX, at least one X must be asparagine, 
90 adjacent units of HA, ai 

. tyrosine lysine, methionine, argmine, 

phenylalanine, tryptophan, lyrosme, ly 

glutamine, or cysteine. 

As used herein, the term "protein of interest shall 

25 purpose of purification or immobilization. 

AS used herein, the term "fusion protem shall refer 

affinitv peptide and the protein 
U,e protein hybrid contarmng the afftn.., P P 

of interest or any amino acid seqnence of tnte^st. 

protein has the genera, (ormuia RHHX.)..^-. whe^n Rl or R2 
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T, • i,;ctiHitiP X is an amino acid other than 
the protein of interest, H is histidme. X is an 

1 R m- 2-30 and wherein if n=l for more than two 
histidine, n= -8, ^ ^^^^ ^^^^^^^^^^ 

adiacent units 01 xlA, ai 

"secrcdon signal sequence" shaU refer » an an^o a^d signa^ 

^^a. s«r.ence o«»ide *e ceU membrane. In U-e present 
r l«in of U.e present invenUon may contam snch a 
0 fusron pr..«n P purification, 

secretion sequence to enhance 

Kepresentaave examples of secretion signal setptences are 
known to a.ose having ordinary skiU in this art 

AS used herein, the «=rm "proteolync stte shaU refer 
.3 u, any amino acid sequence recognized by any proteolytic enzyme^ 
I Z present case, a fusion protein of the present inventron may 
H'such a proteolytic site between the protein of interes an 
.he afrmlty peptide an.or other ^no ^ ^ 
protein of interest may be separated easUy 
20 heterologous amino acid sequences. 

As used herein, the term "metal .on refers to 
^eta. ion for which the affinity peptide has affinity »d that can 
^ used for puriflcauon or immobiUzation of a fuston p otern. 

such metal ions include Ni*^. Co^^. Zn*^- Cu^^- ^ ^^^^^ 

AS used herein, the terms "adsorbent or sohd 

support" Shall refer to a chromatography or immobilizauon 

medium used to immobilize a metal ion. 

AS used herein, the term "regeneration", m the context 

of «,e fusion protein, shall refer to the process of separating or 
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eUminating the affinity peptide and other heterologous amino acid 
sequences from the fusion protein to render the protein of 

interest after purification. 

So that the matter in which the above-recited features, 
advantages, and objects of the invention become clear and can be 
understood in detail, particular descriptions of the invention may 
be had by reference to particular embodiments described in the 
Examples below; however, the following description and examples - 
are given for the purpose of illustrating various, specific 
embodiments of the invention, and are not meant to limit the 
scope of the invention in any fashion. 



EXAMPLE 1 



nT, ^..tinn nf 1. — A.^^^r.o...rr frnm rhirken breast muscle 

A naturally-occurring peptide sequence from the N- 
terminus of lactate dehydrogenase (LDH) from chicken muscle 
{Gallus gallus) was used for initial experiments. The protein 
includes a stretch of approximately 30 amino acids which has a 
sequence consistent with the general formula of the fusion protein 
of the present invention 

(SLKDHLIHNVHKEEHAHAHNKISWGVGAVGM (SEQ ID No:6)). 
Further. LDH has the feature that the enzyme itself can be assayed 
5 easily for activity. Thus, the naturally-occurring chicken muscle 
LDH served as a "fusion protein" for these experiments in the 
sense that it contained both a high affinity peptide and a protein 
of interest. 
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Extracaon of chicken breast muscle IDH was 
performed by cuttmg 15 g of frozen chicken muscle, free of blood 
vessels, inu, smaU pieces and ttansferring fte material to a 
commercial blende, along with 150 mL of extraction buffer (50 
5 ,aM sodinm phospha«=. 1 mM EDTA. 1 mM magnesium acetate pH 
75 1 mM 2-mercap.oethanol (0.2 L) stored for at least 30 
mnutes at 4-C). The mixture »as homogenized twice at 4'C for 30 
seconds, with a 10-minn.e pause between the bursts. After the 
second homog^izatio.. the mixture was transferred to centrifuge 
,0 tubes «.d centtifuged at 4»C and 10.000 x g for 30 minutes. The 
clear supernatant was collected «k1 used as a starting sample for 
the purificatton of lacote dehydrogenase. 



EXAMPLE 2 



Lactate dehydrogenase was punfied by MAC m the 
» following mamren approximately 5 mL of Chelating Sepharose 
(Amersham. Pharmacia) was .r««ferred to a vacuum bottle 
diluted with an equal volume of deionized water and degassed 
under vacuum for 10 minutes. The gel suspension was poured 
into a column (10x1 cm. i. d.) trapped on the bot«>m with a 
25 degassed adapter and left to seme. The column was then fiUed to 
^ top with degassed deioniz«i water, and a top adapter was 
gently pushed down the column bed until there was no space 
between the top surface of d.e gel and U.e adapter. The column 
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was washed with 3 column volumes of deionized water at a flow 

rate of 0.5 mL per min. 

The chicken muscle extract (14 mL) was equilibrated 
by gel filtration on Sephadex G-25 columns with equilibration 
5 buffer (20 mM sodium phosphate buffer containing 1.0 M sodium 
chloride and 0.06 M imidazole pH 7.0 (1 D). The IMAC column 
was then charged with Ni(ID ions using 20 mL of a 0.02M 
Ni(N03)2 solution. The excess metal was washed from the column 
with deionized water at a flow rate of 0.5 mL per minute and the 
10 column was then equilibrated with 5-10 volumes of equilibration 
buffer (20 mM sodium phosphate buffer containing 1.0 M sodium 
chloride and 0.06 M imidazole pH 7.0 (1 L)). 

The IMAC column was prepared by loading the 
equilibrated extract on to the IMAC column at a flow rate of 0.5 
15 mL per min. Fractions of 1 mL were collected. The colwnn was 
washed with equiUbration buffer until a baseUne was reached 
(absorbance of the fractions at 280 mn was less than 2 mAU 
higher than the absorbance of the equilibration buffer). The 
adsorbed material was eluted with elution buffer (20 mM sodium 
20 phosphate buffer containing 1.0 M sodium chloride and 0.3 M 
imidazole pH 7.0 (0.2 L)) and absorbance at 280 mn was 
determined on a spectrophotometer. Protein content of each 
fraction was determined as described in M. Bradford, Analytical 
Biochemistry. 72 (1976) 248, and lactate dehydrogenase activity 
25 was determined as described in F. Kubowitz and P. Ott, Biochem. Z. 
314 (1943) 94. Results indicated that more tiian 95% of the 
lactate dehydrogenase activity was recovered in the elution 
fractions. 
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EXAMPLE 3 

r^arant ^riration of lartate dehvdropenase bindjng 
5 Further experiments were performed both with native 

LDH, a tetramer of about 140 kD, and a subunit of the enzyme, 
obtained after warming of the crude chicken muscle extract to 
45°C for 10 minutes. Both the tetramer and the subunit were 
allowed to associate with the immobiUzed Ni support, and both 
10 forms of LDH were retained. This result demonstrates that the 
retention of the LDH enzyme on immobilized Ni is not peculiar to 
the tetrameric form of the peptide; that is, binding does not 
require "cooperation" between subunits. Instead, the single 
subunit of the enzyme also had affinity for the nickel ion, and this 
15 affinity was demonstrated to be virtually identical to the affinity 
shown for the tetramer. Both the native protein and the subunit 
were adsorbed in buffer with an imidazole concentration up to 60 
mmol and both were eluted completely at a concentration of 300 
nunol imidazole. 

20 To ascertain that it is the polyhistidine portion of the 

LDH that provides affinity for the nickel ion, the tetramic form of 
the LDH enzyme was subjected to CNBr cleavage to produce a 
mixture of peptides. This mixture of peptides was appUed to a Ni- 
IDA column with metal ion capacity of 32 mmol per mL gel. 

25 Loading conditions were the same as those used for the 
purification of the enzyme from the crude extract, described 
above. The adsorbed material was eluted witii 300 mmol 
imidazole, and subjected to RPC chromatography. The 
chromatographic peak containing about 80% of tiie adsorbed 
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material was then subjected to amino acid analysis. The results 
obtained demonstrate that this peak corresponds to the N- 
terminal peptide from LDH and that this peptide that contains the 
polyhistidine sequence. In addition, the fact that the peptide 
retained its binding affinity even after treatment with CNBr in 
presence of 70% TFA is proof that the binding is not due to a rigid 
secondary conformation structure. 



JO EXAMPLE 4 

p^^rf^.,.;».^ nf Inrfate c ^ f ^yHmyBtiase on ro?+-TAT,ON agarQ?e 

Extraction of chicken breast muscle LDH was 
performed as in Example 1. and equilibrated by gel filtration on 
15 Sephadex G-25 columns wifli equilibration buffer (20 mM sodium 
phosphate buffer containing 1.0 M sodium chloride and 0.06 M 

imidazole pH 7.0 (1 L)). 

The IMAC column was prepared in the following 
mamier: Approximately 2.75 mL of Co2+-TAL0N Superflow 6 

20 (Amersham, Pharmacia) was transferred to a vacuum bottle, 
diluted witii tiie same volume of deionized water and degassed 
under vacuum for 10 minutes. The gel suspension was poured 
into a column (3x1 cm. i.d.) trapped on Uie bottom wifli a 
degassed adapter and left to settie. The column was fiUed to the 

25 top witii degassed deionized water, and a top adapter was gently 
pushed down toward tiie column bed until tiiere was no space 
between tiie top surface of Uie gel and tiie adapter. The column 
was washed witii 3 column volumes of deionized water at a flow 
rate of 0.5 mL par min. 
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Purification of the fusion protein on Co2+-TALON 
Superflow 6 was carried out by first equilibrating the IMAC 
column with 5 to 10 column volumes of the equilibration buffer. 
The sample was then loaded on the IMAC column at a flow rate of 

5 1.0 mL per min, and 1 mL fractions were collected. The column 
was washed with the equilibration buffer until a baseline was 
reached (absorbance of the fractions at 280 nm as less than 2 
mAU higher than the absorbance of flie equilibration buffer). 

The adsorbed material was eluted with elution buffer 

10 (20 mM sodium phosphate buffer containing 1.0 M sodium 
chloride and 0.3 M imidazole pH 7.0 (0.2 L)) and absorbance at 
280 nm was determined on a spectrophotometer. Protein content 
of each fraction was determined as described in M. Bradford, 
Analytical Biochemistry, 72 (1976) 248, and lactate 

15 dehydrogenase activity was determined as described in F. 
Kubowitz and P. Ott, Biochem. Z., 314 (1943) 94. As in Example 2, 
more than 95% of the lactate dehydrogenase activity was 
recovered in the elution fractions. 



20 

EXAMPLE 5 

Tsolarinn and purification o f fusion prntein consisting of affinity 
ppptiHp. and Green Fliinrescen T Protein UV Mutant (GFPnv) 
25 An affinity peptide/GFP fusion protein was isolated 

from E. coli cells which had been transformed with the pGFPuv.HS 
vector. Cell paste (0.39 g) was transferred to pre-cooled mortar, 
1.2 g of alumina was added, and the mixture was ground for 2 
minutes. Extraction buffer (5 mL, stored at 4*0 was added, and. 
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after additional grinding for 2 minutes, the mixture was 
transferred into four eppendorph tubes. The suspension was 
added to the eppendorph tubes and centrifuged for 12 minutes at 
12,000 rpm (11,750 x g). The clear supernatant (approximately 6 

5 mL) was used as a starting sample for IMAC. 

The extraction and chromatography equilibration 
buffers consisted of 20 mM sodium phosphate buffer containing 
1.0 M sodium chloride and 5 mM imidazole pH 7.0 (1 L). The 
elution buffer for IMAC consisted of 20 mM sodium phosphate 

10 buffer containing 1.0 M sodium chloride and 150 mM imidazole 

pH 7.0 (0.2 L). 

The IMAC was carried out in the following manner: 
Approximately 2.75 mL of Co2+-TALON Superflow 6 (Amersham, 
Pharmacia) was transferred to a vacuum bottle, diluted with the 

15 same volume of deionized water and degassed under vacuum for 
10 minutes. The gel suspension was poured into a column (3x1 
cm. i.d.) trapped on the bottom with a degassed adapter and left 
to settle. The column was filled to the top with degassed 
deionized water, and a top adapter was gently pushed down 

20 toward the column bed until there was no space between the top 
surface of the gel and the adapter. The column was washed with 
3 column volumes of deionized water at a flow rate of 0.5 mL per 
min. 

Purification of the fusion protein on Co2+-TALON 
25 Superflow 6 was carried out by first equilibrating the IMAC 
column with 5 to 10 column volumes of the equilibration buffer. 
The sample was then loaded on the IMAC column at a flow rate of 
1.0 mL per min, and 1 mL fractions were collected. The column 
was washed with the equilibration buffer until a baseline was 
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reached (absorbance of the fractions at 280 nm as less than 2 
mAU higher than the absorbance of the equilibration buffer). The 
adsorbed material was then eluted with elution buffer. 

Absorbance of each fraction at 280 nm was 

5 determined on a spectrophotometer; and protein content of each 
fraction was determined. Fluorescence of each fraction was 
determined on a microplate reader, and the purity of the fusion 
protein was determined also by SDS-electrophoresis. More than 
85% of the fusion protein was recovered in the fractions obtained. 

10 Part of the cDNA sequence, and the amino acid sequence encoded 
by this cDNA sequence, of a vector containing the affinity peptide 
at the N-terminus of Green Huoresecent Protein-UV mutant 
(GFPuv) is shown in SEQID No. 1 and SEQ ID No. 2, respectively. 
The full cDNA sequence of a vector containing the construct of the 

15 affinity peptide at the N-terminus of GFPuv is shown in SEQ ID No. 
3. The full cDNA sequence of a vector containing part of the 
affinity peptide at the N-terminus of the enterokinase cleavage 
site and the amino acid sequence encoded by this d5NA 
corresponding to the start of translation site, the affinity peptide 

20 and the multiple cloning site are shown in SEQ ID Nos. 4 and 5. 

EXAMPLE 6 

25 Construction of fus ion proteins 

A DNA sequence corresponding to the affinity peptide 
of the present invention is fused to the DNA coding sequence of a 
protein of interest. The polynucleotide sequence for the affinity 
peptide is fused most generally at or close to the DNA sequence 
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coding for the N- or C-terminal amino acid of the protein of 
interest. This results in a DNA sequence which codes for a fusion 
protein comprising the affinity peptide and the protein of interest. 

In addition, a polynucleotide sequence that codes for a 

5 protein proteolytic site is incorporated into the fusion protein DNA 
sequence between the sequence for the affinity peptide and the 
sequence of the protein of interest. This type of DNA construct 
results in a fusion protein product having a proteolytic site. This 
site allows for the eventual regeneration of the protein of interest 

10 from die fusion protein by limited proteolysis and a second 
chromatography step. The second chromatography step, in which 
tiie product of die proteolysis is loaded onto an immobilized metal 
ion affinity column, results in die separation of the protein of 
interest from the affinity peptide. 

15 An additional embodiment of the present invention 

provides a DNA sequence coding for a polypeptide "secretion 
signal" introduced into the DNA that codes for the fusion protein. 
This secretion signal, when expressed, causes die fusion protein to 
be secreted into the culture media after the fusion protein is 

20 syntiiesized in tiie ceU. Since a large number of cellular proteins 
are not transported out of the cell, isolation and purification of the 
fusion protein is enhanced as the requirements for cell disruption, 
extraction and removal of unwanted cell components are 
eliminated. 

25 The present invention is directed to a fusion protein of 

general formula Rl-(HX„)m-R2 comprising: a protein of interest Rl 
or R2 fused at its amino terminus or carboxy terminus to at least 
one affinity peptide, said affinity peptide having a formula (HXn)ii,. 
wherein H is histidine, X is an amino acid other flian histidine, n= 
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1-8, m= 2-30, and wherein if n=l for more than two adjacent 
units of HX. at least one X must be asparagine, phenylalanine, 
tryptophan, tyrosine, lysine, methionine, arginine, glutamine, or 
cysteine. Preferably, n=l-4 and m=3-10. In one preferred 

5 embodiment, n=l-4 and m=3-6. Preferably, if n=l for more than 
two adjacent units of HX, only one Xis asparagine, phenylalanine, 
tryptophan, tyrosine, lysine, methionine, arginine, glutamine, or 
cysteine. In one preferred embodiment, the fusion protein has a 
sequence SEQ ID No. 1. The fusion protein may contain at least 

10 one protease cleavage site between said protein of interest and 
said affinity peptide. Preferably, the affinity peptide has affinity 
for metal ions. A representative metal ion is a nickel ion. The 
fusion protein may fiirther comprise a secretion signal sequence. 

The present invention is also directed to a I»tA 

15 sequence coding for a fusion protein of general formula Rl- 
(HXn)m-R2 comprising: a protein of interest Rl or R2 fused at its 
amino terminus or carboxy terminus to at least one affinity 
peptide, said affinity peptide having a formula (HXn)m. wherein H 
is histidine, X is an amino acid other than histidine, n= 1-8, m= 2- 

20 30, and wherein if n=l for more than two adjacent units of HX,at 
least one X must be asparagine, phenylalanine, tryptophan, 
tyrosine, lysine, methionine, arginine, glutamine, or cysteine. 
Preferably, the fusion protein has the sequence shown in SEQ ID 
No:6. 

25 The present invention is also directed to a 

recombinant vector comprising a DNA sequence disclosed herein, 
wherein said recombinant vector is capable of directing 
expression oif said DNA sequence for said fusion protein in a 
suitable host organism. The present invention is also directed to a 
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host organism containing a recombinant vector disclosed herein » 
wherein said organism is capable of expressing said fusion 
protein. 

The present invention is also directed to a method for 
5 purifying the fusion protein disclosed herein, comprising the steps 
of: 

contacting a protein sample containing said fusion protein in a 
mixture with other proteins with a metal chelate resin under 
conditions where said fusion protein binds to said resin to produce 

10 a resin-fusion protein complex; washing said resin-fusion protein 
complex with a buffer to remove said other, unbound proteins; 
and eluting said bound fusion protein from the washed resin- 
fusion protein complex; wherein said eluted fusion protein is 
purified. This method may further comprise the step of cleaving 

15 said protein of interest from said affinity peptide. Moreover, this 
method further comprises the step of separating said cleaved 
protein of interest from said affinity peptide using a. metal 
chelate resin under conditions where said affinity peptide binds to 
said metal of said resin and said protein of interest does not 

20 Any patents or publications mentioned in this 

specification are indicative of the levels of those skilled in the art 
to which the invention pertains. These patents and publications 
are herein incorporated by reference to the same extent as if each 
individual publication was specifically and individually indicated 

25 to be incorporated by reference. 

One skilled in the art will readily appreciate that the 
present invention is well adapted to carry out the objects and 
obtain the ends and advantages mentioned, as well as those 
inherent tiierein. The present examples along with tiie methods, 

20 
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procedures, treatments, molecules, and specific compounds 
described herein are presently representative of preferred 
embodiments, are exemplary, and are not intended as limitations 
on the scope of the invention. Changes therein and other uses will 
5 occur to those skilled in the art which are encompassed within the 
spirit of the invention as defined by the scope of the claims. 
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WHAT IS CLAIMED IS: 

1. A fusion protein comprising: a protein of interest 
fused at its amino terminus or carboxy terminus to at least one 

5 affinity peptide, said fusion protein having a formula Rl-(HXn)m- 
R2, wherein Rl or R2 is said protein of interest, H is histidine, X is 
an amino acid other than histidine, n= 1-8, m= 2-30, and wherein 
if n=l for more than two adjacent units of HX,at least one Xmust 
be asparagine, phenylalanine, tryptophan, tyrosine, lysine, 

10 methionine, arginine, glutamine, or cysteine. 



2. The fusion protein of claim 1, wherein n=l-4. 



15 



3. The fusion protein of claim 1, wherein m=3-10. 



4. The fusion protein of claim 1, wherein n=l-4 and 



20 m=3-6. 



5. The fusion protein of claim 1, wherein if n=l for 
more than two adjacent units of HX. only one X is asparagine, 
25 phenylalanine, tryptophan, tyrosine, lysine, methionine, arginine, 
glutamine, or cysteine. 
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6. The fusion protein of claim 2, wherein said 
fusion protein has a sequence SEQ ID No. 6. 



5 7. The fusion protein of claim 1, wherein said 

fusion protein contains at least one protease cleavage site between 
said protein of interest and said afGnity peptide. 

JO 8. The fusion protein of claim 1, wherein said 

affinity peptide has affinity for metal ions. 

9. The fusion protein of claim 8, wherein said metal 
IS ions are nickel ions. 

10. The fusion protein of claim 1, wherein said 
fusion protein further comprises a secretion signal sequence. 

20 

11. A DNA sequence coding for a fusion protein 
comprising a protein of interest fused at its amino- or carboxy- 
terminus to at least one affinity peptide, said fusion protein 

25 having the formula R1-(HX„)„-R2, wherein Rl or R2 is said 
protein of interest, n= 1-8, m= 2-30, and wherein if n=l for more 
than two adjacent units of HX,at least one Xmust be asparagine, 
phenylalanine, tryptophan, tyrosine, lysine, methionine, arginine, 
gltttamine, or cysteine. 
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12. The DNA sequence of claim 11. wherein said 
fusion protein has the sequence shown in SEQ ID No:l. 

5 

13. A recombinant vector comprising a DNA 
sequence of claim 11, wherein said recombinant vector is capable 
of directing expression of said DNA sequence for said fusion 

10 protein in a suitable host organism. 

14. A host organism containing a recombinant vector 
of claim 13, wherein said organism is capable of expressing said 

15 fusion protein, 

15. A method for purifying the fusion protein of 
claim 1, comprising the steps of: 

20 contacting a protein sample containing said fusion 

protein in a mixture with oAer proteins with a metal chelate resin 
under conditions where said fusion protein binds to said resin to 
produce a resin-fusion protein complex; 

washing said resin-fusion protein complex with a 
25 buffer to remove said other, unbound proteins; and 

eluting said bound fusion protein from the washed 
resin-fusion protein complex; wherein said eluted fusion protein is 
purified. 
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16. The method of claim 15, further comprising the 
step of cleaving said protein of interest from said affinity peptide. 

5 

17. The method of claim 16, further comprising the 
step of separating said cleaved protein of interest from said 
affinity peptide using a. metal chelate resin under conditions 
where said affinity peptide binds to said metal of said resin and 

10 said protein of interest does not. 
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SEQUENCE LISTING 

<110> TChaga, Grigoriy 

Jokhadze, George G. 

<120> conpositions and Methods for Protein Purification 
Based on a Novel Metal Ion Affinity Site 

<130> D6094PCT 

<140> 

<141> 1999-05-14 

<150> US 09/078,687 

<151> 1998-05-14 

<160> 6 



<210> 1 
<211> 840 
<212> ENA 

<213> artificial sequence 
<220> 

<223> Partial cDNA sequence of a vector containing the 

affinity peptide at the N-terminus of Green 
Fluorescent Protein-UV mutant (GFPuv) 

<400> 1 

atgaccatga ttacgccaag cttgtctctc aaggatcatc tcatccacaa 50 
tgtccacaaa gaggagcacg ctcatgccca caacaagatc agcgtggttg 100 
gtgtgggtgc agttggaccg gtaagtaaag gagaagaact tttcactgga 150 
gttgtcccaa ttcttgttga attagatggt gatgttaatg ggcacaaatt 200 
ttctgtcagt ggagagggtg aaggtgatgc aacatacgga aaacttaccc 250 
ttaaatttat ttgcactact ggaaaactac ctgttccatg gccaacactt 300 
gtcactactt tctcttatgg tgttcaatgc ttttcccgtt atccggatca 350 
tatgaaacgg catgactttt tcaagagtgc catgcccgaa ggttatgtac 400 
aggaacgcac tatatctttc aaagatgacg ggaactacaa gacgcgtgct 450 
gaagtcaagt ttgaaggtga tacccttgtt aatcgtatcg agttaaaagg 500 
tattgatttt aaagaagatg gaaacattct cggacacaaa ctcgagtaca 550 
actataactc acacaatgta tacatcacgg cagacaaaca aaagaatgga 600 
atcaaagcta acttcaaaat tcgccacaac attgaagatg gatccgttca 650 
actagcagac cattatcaac aaaatactcc aattggcgat ggccctgtcc 700 
ttttaccaga caaccattac ctgtcgacac aatctgccct ttcgaaagat 750 
cccaacgaaa agcgtgacca catggtcctt cttgagtttg taactgctgc 800 
tgggattaca catggcatgg atgagctcta caaataatga 840 
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<210> 



2 



<211> 



278 



<212> 



PRT 



<213> 



aortificial sequence 



<220> 



<223> 



Amino acid sequence encoded by pgurtial cDNA sequence 



of a vector containing the affinity peptide at the 
N-terminus of Green Fluorescent Protein-UV mutant (GFPu- 



Ifet Thr Met He Thr Pro Ser Leu Ser Leu Lys Asp His Leu He 

5 10 15 

His Asn Val His Lys Glu Glu His Ala His Ala His Asn Lys He 

20 25 30 

Ser Val Val Gly Val Gly Ala Val Gly Pro Val Ser Lys Gly Glu 

35 40 45 

Glu Leu Phe Ihr Gly Val Val Pro He Leu Val Glu Leu Asp Gly 

50 55 60 

A^ Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly 

65 70 75 

Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys Thr Thr 

80 85 90 

Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe Ser 

95 100 105 

Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Arg 

110 115 120 

His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

125 130 135 

Arg Thr He Ser Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala 

140 145 150 

Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu 

155 160 165 

lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys 

170 175 180 

Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Thr Ala Asp 

185 190 195 

Lys Gin Lys Asn Gly He Lys Ala Asn Phe Lys He Arg His Asn 



<400> 



2 



200 



205 



210 
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lie Glu Asp 


Gly Ser Val Gin Leu Ala 


Asp 


His Tyr Gin Gin Asn 




215 


220 


225 


Uir Pro lie 


Gly Asp Gly Pro Val Leu 


Leu 


Pro Asp Asn His Tyr 




230 


235 


240 


Leu Ser T!hr 


Gin Ser AXa Lieu £>er ijys 




Pro Asn Glu Lvs Ara 




245 


250 


255 


Asp His Met 


Val Leu Leu Glu Phe Val 


Thr 


Ala Ala Gly lie Thr 




260 


265 


270 


xixs \9XY 


Aca^ Glu Leu TVX LVS 








275 






<210> 


3 






<211> 


3384 






<212> 


DNA 






<213> 


artificial sequence 







<220> 

<223> CENA sequence of a vector containing the construct 

of the affinity peptide at the N- terminus of GFPuv 
<400> 3 

agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa 50 

tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa 100 

cgcaattaat gtgagttagc tcactcatta ggcaccccag gctttacact 150 

ttatgcttcc ggctcgtatg ttgtgtggaa ttgtgagcgg ataacaattt 200 

cacacaggaa acagctatga ccatgattac gccaagcttg tctctcaagg 250 

atcatctcat ccacaatgtc cacaaagagg agcacgctca tgcccacaac 300 

aagatcagcg tggttggtgt gggtgcagtt ggaccggtaa gtaaaggaga 350 

agaacttttc actggagttg tcccaattct tgttgaatta gatggtgatg 400 

ttaatgggca caaattttct gtcagtggag agggtgaagg tgatgcaaca 450 

tacggaaaac ttacccttaa atttatttgc actactggaa aactacctgt 500 

tccatggcca acacttgtca ctactttctc ttatggtgtt caatgctttt 550 

cccgttatcc ggatcatatg aaacggcatg actttttcaa gagtgccatg 600 

cccgaaggtt atgtacagga acgcactata tctttcaaag atgacgggaa 650 

ctacaagacg cgtgctgaag tcaagtttga aggtgatacc cttgttaatc 700 

gtatcgagtt aaaaggtatt gattttaaag aagatggaaa cattctcgga 750 

cacaaactcg agtacaacta taactcacac aatgtataca tcacggcaga 800 

caaacaaaag aatggaatca aagctaactt caaaattcgc cacaacattg 850 

aagatggatc cgttcaacta gcagaccatt atcaacaaaa tactccaatt 900 

ggcgatggcc ctgtcctttt accagacaac cattacctgt cgacacaatc 950 
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tgccctttcg aaagatccca acgeiaaagcg tgaccacatg gtccttcttg 1000 
agtttgtaac tgctgctggg attacacatg gcatggatga gctctacaaa 1050 
taatgaattc caactgagcg ccggtcgcta ccattaccaa cttgtctggt 1100 
gtcaaaaata ataggcctac tagtcggccg tacgggccct ttcgtctcgc 1150 
gcgtttcggt gatgacggtg aaaacctctg acacatgcag ctcccggaga 1200 
cggtcacagc ttgtctgtaa gcggatgccg ggagcagaca agcccgtcag 1250 
ggcgcgtcag cgggtgttgg cgggtgtcgg ggctggctta actatgcggc 1300 
atcagagcag attgtactga gagtgcacca tatgcggtgt gaaataccgc 1350 
acagatgcgt aaggagaaaa taccgcatca ggcggcctta agggcctcgt 1400 
gatacgccta tttttatagg ttaatgtcat gataataatg gtttcttaga 1450 
cgtcaggtgg cacttttxrgg ggaaatgtgc gcggaacccc tatttgttta 1500 
tttttctaaa tacattcaeia tatgtatccg ctcatgagac aataaccctg 1550 
ataaatgctt caataatatt gaaaaaggaa gagtatgagt attcaacatt 1600 
tccgtgtcgc ccttattccc ttttttgcgg cattttgcct tcctgttttt 1650 
gctcacccag aaacgctggt gaaagtaaaa gatgctgaag atcagttggg 1700 
tgcacgagtg ggttacatcg aactggatct caacagcggt aagatccttg 1750 
agagttttcg ccccgaagaa cgttttccaa tgatgagcac ttttaaagtt 1800 
ctgctatgtg gcgcggtatt atcccgtatt gacgccgggc aagagcaact 1850 
cggtcgccgc atacactatt ctcagaatga cttggttgag tactcaccag 1900 
tcacagaaaa gcatcttacg gatggcatga cagtaagaga attatgcagt 1950 
gctgccataa ccatgagtga taacactgcg gccaacttac ttctgacaac 2000 
gatcggagga ccgaaggagc taaccgcttt tttgcacaac atgggggatc 2050 
atgtaactcg ccttgatcgt tgggaaccgg agctgaatga agccatacca 2100 
aacgacgagc gtgacaccac gatgcctgta gcaatggcaa caacgttgcg 2150 
caaactatta actggcgaac tacttactct agcttcccgg caacaattaa 2200 
tagactggat ggaggcggat aaagttgcag gaccacttct gcgctcggcc 2250 
cttccggctg gctggtttat tgctgataaa tctggagccg gtgagcgtgg 2300 
gtctcgcggt atcattgcag cactggggcc agatggtaag ccctcccgta 2350 
tcgtagttat ctacacgacg gggagtcagg caactatgga tgaacgaaat 2400 
agacagatcg ctgagatagg tgcctcactg attaagcatt ggtaactgtc 2450 
agaccaagtt tactcatata tactttagat tgatttaaaa cttcattttt 2500 
aatttaaaag gatctaggtg aagatccttt ttgataatct catgaccaaa 2550 
atcccttaac gtgagttttc gttccactga gcgtcagacc ccgtagaaaa 2600 
gatcaaagga tcttcttgag atcctttttt tctgcgcgta atctgctgct 2650 
tgcaaacaaa aaaaccaccg ctaccagcgg tggtttgttt gccggatcaa 2700 
gagctaccaa ctctttttcc gaaggtaact ggcttcagca gagcgcagat 2750 
accaaatact gtccttctag tgtagccgta gttaggccac cacttcaaga 2800 
actctgtagc accgcctaca tacctcgctc tgctaatcct gttaccagtg 2850 
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gctgctgcca gtggcgataa gtcgtgtctt accgggttgg actcaagacg 2900 
atagttaccg gataaggcgc agcggtcggg ctgaacgggg ggttcgtgca 2950 
cacagcccag cttggagcga acgacctaca ccgaactgag atacctacag 3000 
cgtgagctat gagaaagcgc cacgcttccc gaagggagaa aggcggacag 3050 
gtatccggta agcggcaggg tcggaacagg agagcgcacg agggagcttc 3100 
cagggggaaa cgcctggtat ctttatagtc ctgtcgggtt tcgccacctc 3150 
tgacttgagc gtcgattttt gtgatgctcg tcaggggggc ggagcctatg 3200 
gaaaaacgcc agcaacgcgg cctttttacg gttcctggcc ttttgctggc 3250 
cttttgctca catgttcttt cctgcgttat cccctgattc tgtggataac 3300 
cgtattaccg cctttgagtg agctgatacc gctcgccgca gccgaacgac 3350 
cgagcgcagc gagtcagtga gcgaggaagc ggaa 3384 

<210> 4 
<211> 2754 
<212> DNA 

<213> artificial sequence 

<220> 

<223> cDNA sequence of a vector containing part of tbe 

affinity p^tide at the N-terminus of enterokinase 
cleavage site 
<400> 4 

gacgaaaggg cctcgtgata cgcctatttt 

atEUitggttt cttagacgtc aggtggcact 

aacccctatt tgtttatttt tctaaataca 

tgagacaata accctgataa atgcttcaat 

atgagtattc aacatttccg tgtcgccctt 

ttgccttcct gtttttgctc acccagaaac 

ctgaagatca gttgggtgca cgagtgggtt 

agcggtaaga tccttgagag ttttcgcccc 

gagcactttt aaagttctgc tatgtggcgc 

ccgggcaaga gcaactcggt cgccgcatac 

gttgagtact caccagtcac agaaaagcat 

aagagaatta tgcagtgctg ccataaccat 

acttacttct gacaacgatc ggaggaccga 

cacaacatgg gggatcatgt aactcgcctt 

gaatgaagcc ataccaaacg acgagcgtga 

tggcaacaac gttgcgcaaa ctattaactg 



tataggttaa 


tgtcatgata 


50 


tttcggggaa 


atgtgcgcgg 


100 


ttcaaatatg 


tatccgctca 


150 


aatattgaaa 


aaggaagagt 


200 


attccctttt 


ttgcggcatt 


250 


gctggtgaaa 


gtaaaagatg 


300 


acatcgsiact 


ggatctc£iac 


350 


geiagaacgtt 


ttccaatgat 


400 


ggtattatcc 


cgtattgacg 


450 


actattctca 


gaatgacttg 


500 


cttacggatg 


gcatgacagt 


550 


gagtgataac 


actgcggcca 


600 


aggagctaac 


cgcttttttg 


650 


gatcgttggg 


aaccggagct 


700 


caccacgatg 


cctgtagcaa 


750 


gcgaactact 


tactctagct 


800 
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tcccggcaac aattaataga ctggatggag gcggataaag ttgcaggacc 850 
acttctgcgc tcggcccttc cggctggctg gtttattgct gataaatctg 900 
gagccggtga gcgtgggtct cgcggtatca ttgcagcact ggggccagat 950 
ggtaagccct cccgtatcgt agttatctac acgacgggga gtcaggcaac 1000 
tatggatgaa cgaaatagac agatcgctga gataggtgcc tcactgatta 1050 
agcattggta actgtcagac caagtttact catatatact ttagattgat 1100 
ttaaaacttc atttttaatt taaaaggatc taggtgaaga tcctttttga 1150 
taatctcatg accaaaatcc cttaacgtga gttttcgttc cactgagcgt 1200 
cagaccccgt agaaaagatc aaaggatctt cttgagatcc tttttttctg 1250 
cgcgtaatct gctgcttgca aacaaaaaaa ccaccgctac cagcggtggt 1300 
ttgtttgccg gatcaagagc taccaactct ttttccgaag gtaactggct 1350 
tcagcagagc gcagatacca aatactgtcc ttctagtgta gccgtagtta 1400 
ggccaccact tcaagaactc tgtagcaccg cctacatacc tcgctctgct 1450 
aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg 1500 
ggttggactc aagacgatag ttaccggata aggcgcagcg gtcgggctga 1550 
acggggggtt cgtgcacaca gcccagcttg gagcgaacga cctacaccga 1600 
actgagatac ctacagcgtg agctatgaga aagcgccacg cttcccgaag 1650 
ggagaaaggc ggacaggtat ccggtaagcg gcagggtcgg aacaggagag 1700 
cgcacgaggg agcttccagg gggaaacgcc tggtatcttt atagtcctgt 1750 
cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag 1800 
gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc 1850 
ctggcctttt gctggccttt tgctcacatg ttctttcctg cgttatcccc 1900 
tgattctgtg gataaccgta ttaccgcctt tgagtgagct gataccgctc 1950 
gccgcagccg aacgaccgag cgcagcgagt cagtgagcga ggaagcggaa 2000 
gagcgcccaa tacgcaaacc gcctctcccc gcgcgttggc cgattcatta 2050 
atgcagctgg cacgacaggt ttcccgactg gaaagcgggc agtgagcgca 2100 
acgcaattaa tgtgagttag ctcactcatt aggcacccca ggctttacac 2150 
tttatgcttc cggctcgtat gttgtgtgga attgtgagcg gataacaatt 2200 
tcacacagga aacagctatg accatgatta cgccaagctt gaaggatcat 2250 
ctcatccaca atgtccacaa agaggagcac gctcatgccc acaacaagat 2300 
cgatgacgat gacaaagtcg acggatcccc gggtaccgag ctcgtaatta 2350 
gctgagaatt cactggccgt cgttttacaa cgtcgtgact gggaaaaccc 2400 
tggcgttacc caacttaatc gccttgcagc acatccccct ttcgccagct 2450 
ggcgtaatag cgaagaggcc cgcaccgatc gcccttccca acagttgcgc 2500 
agcctgaatg gcgaatggcg cctgatgcgg tattttctcc ttacgcatct 2550 
gtgcggtatt tcacaccgca tatggtgcac tctcagtaca atctgctctg 2600 
atgccgcata gttaagccag ccccgacacc cgccaacacc cgctgacgcg 2650 
ccctgacggg cttgtctgct cccggcatcc gcttacagac aagctgtgac 2700 
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cgtctccggg agctgcatgt gtcagaggtt ttcaccgtca tcaccgaaac 2750 

2754 

gcgc 



<210> 
<211> 
<212> 
<213> 
<220> 
<223> 



<400> 
Met Thr Met 

Val His Lys 

Asp Asp Lys 



5 

45 
PRT 

artificial sequ««:e 

Amino acid sequence encoded by cDNA corresponding 
to the start of translation site, the affinity 
peptide and the multiple cloning site 
5 

lie Thr Pro Ser Leu Lys Asp His Leu lie His Asn 
5 10 15 

Glu Glu His Ala His Ala His Asn Lys He Asp Asp 

20 25 30 

Val Asp Gly Ser Pro Gly Thr Glu Leu Val He Ser 

35 40 45 



<210> 6 
<211> 32 
<212> PRT 

<213> artificial sequence 
<220> 

<223> Amino acid sequence of fusion protein «diere n=l-4 

for the affinity p^tide 
<400> 6 

Ser Leu Lys Asp His Leu He His Asn Val His Lys Glu Glu His 
5 10 15 

Ala His Ala His Asn Lys He Ser Val Val Gly Val Gly Ala Val 
20 25 30 

Gly Met 
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